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Abstract. In the paper [25 J, written in collaboration with Gesine Reinert, we proved a uni- 
versality principle for the Gaussian Wiener chaos. In the present work, we aim at providing an 
original example of application of this principle in the framework of random matrix theory. More 
specifically by combining the result in |25| with some combinatorial estimates, we are able to 
prove multi-dimensional central limit theorems for the spectral moments (of arbitrary degrees) 
associated with random matrices with real-valued i.i.d. entries, satisfying some appropriate mo- 
ment conditions. Our approach has the advantage of yielding, without extra effort, bounds over 
classes of smooth (i.e., thrice differentiable) functions, and it allows to deal directly with discrete 
distributions. As a further application of our estimates, we provide a new "almost sure central 
limit theorem", involving logarithmic means of functions of vectors of traces. 
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1 Introduction 

1.1 Overview and main results 

In the paper [25J, written in collaboration with Gesine Reinert, we proved several univer- 
sality results, involving sequences of random vectors whose components have the form of 
finite homogeneous sums based on sequences of independent random variables. Roughly 
speaking, our main finding implied that, in order to study the normal approximations of 
homogeneous sums (and under suitable moment conditions) it is always possible to replace 
the original sequence with an i.i.d. Gaussian family. The power of this approach resides in 
the fact that homogeneous sums associated with Gaussian sequences are indeed elements of 
the so-called Wiener chaos, so that normal approximations can be established by means of 
the general techniques developed in [241 [27J [28] - that are based on a powerful interaction 
between standard Gaussian analysis, Malliavin calculus (see e.g. [26]) and Stein's method 
(see e.g. [9j). Moreover, in the process one always recovers uniform bounds over suitable 
classes of smooth functions. 

The aim of this paper is to introduce these techniques into the realm of random matrix 
theory. More specifically, our goal is to use the universality principles developed in [25], in 
order to prove the forthcoming Theorem ll.l| which consists in a multidimensional central 
limit theorem (CLT) for traces of non-Hermitian random matrices with i.i.d. real-valued 
entries. As explained below, the computations and estimates involved in the proof of 
Theorem 11.11 will be further applied in Section [5], where we will establish an almost sure 
central limit theorem (ASCLT) for logarithmic means associated with powers of large non- 
Hermitian random matrices. See Theorem 11.51 for a precise statement - as well as [18] for 
a general discussion on ASCLTs. 

Now let X be a centered real random variable, having unit variance and with finite 
moments of all orders, that is, E(X) = 0, E(X 2 ) = 1 and i?|X| n < oo for every n ^ 3. We 
consider a doubly indexed collection X = {Xij : i,j ^ 1} of i.i.d. copies of X. For every 
integer N ^ 2, we denote by Xn the N x N random matrix 



and by Tr(-) and X^, respectively, the usual trace operator and the kth power of X N . 




(1.1) 
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Theorem 1.1 Let the above notation prevail. Fix m ^ 1, as well as integers 

1 ^ kx < . . . < k m . 
Then, the following holds. 

(i) As N ->■ oo, 

(Tr(X^)-E[Tr(X^)],...,Tr(X^)-E[Tr(X^)]) ^ (Z kir .., Z km ) , (1.2) 

where Z = {Z^ : k ^ 1} denotes a collection of real independent centered Gaussian 
random variables such that, for every k ^ 1, E(Z%) = k. 

(ii) Write (5 = E\X\ 3 . Suppose that the function tp : W m — > R is thrice differentiable and 
that its partial derivatives up to the order three are bounded by some constant B < oo. 
Then, there exists a finite constant C = C(f3, B, m, k±, k m ), not depending on N, 
such that 



E 



Tr(X£) - E[rr(X%)) 



Var(Tr(X^)) 



TrjXfr) - E[Tr(X$r)]. 
yV^T^X^)) 



-E 



(1.3) 

^ CN- l ' A . 



Remark 1.2 1. We chose to state and prove Theorem II. II in the case of non-Hermitian 
matrices with real-valued entries, mainly in order to facilitate the connection with 
the universality results proved in [2S]. However, our techniques may be extended to 
the case where the random variable X is complex-valued and with finite absolute 
moments of every order. This line of research will be pursued elsewhere. One should 
also note that, differently from |30j, in the present paper we do not use any technique 
coming from complex analysis. 

2. Fix an integer K ^ 2 and assume that E\X\ 2K < oo, while higher moments are 
allowed to be possibly infinite. By inspection of the forthcoming proof of Theorem 
II. 1| one sees that the CLT (ll.2|) as well as the bound ( II. 3p continue to hold, as long 
as the integers k%, k m verify kj ^ K for j = 1, m. 

3. In a similar vein as at the previous point, by imposing adequate uniform bounds on 
moments one can easily adapt our techniques in order to deal with random matrices 
whose entries are independent but not identically distributed. One crucial fact sup- 
porting this claim is that the universality principles of Section [2] hold for collections 
of independent, and not necessarily identically distributed, random variables. 



3 



4. For non-Hermit ian matrices, limits of moments are not sufficient to provide an ex- 
haustive description of the limiting spectral measure or of the fluctuations around it. 
Rather, one would need to consider polynomials in the eigenvalues and their complex 
conjugates. These quantities cannot be represented using traces of powers of Xn, so 
that our approach cannot be extended to this case. 



1.2 Discussion 

In this section we compare our Theorem 11.11 with some related results proved in the existing 
probabilistic literature. 

1. In the paper |30j, Rider and Silverstein proved the following CLT. 

Theorem 1.3 Let X be a complex random variable such that E(X) = E(X 2 ) = 0, 
E(\X\ 2 ) = I, E(\X\ k ) ^ k ak , k ^ 3 (for some a > 0) and Re(X), Im(X) possess a 
joint bounded density. For N ^ 2, let X^ be defined as in M.l\) . Consider the space % of 
functions f : C — > C which are analytic in a neighborhood of the disk \z\ 4 and otherwise 
bounded. Then, as N — > oo, the random field 

{Tr(f(X N ))-E[Tr(f(X N ))] : / eH} 

converges in the sense of finite- dimensional distributions (f.d.d.) to the centered complex- 
valued Gaussian field {Z(f) : f G H}, whose covariance structure is given by 

E[Z(f)Z(g)] = [ f\z)W) — - (1-4) 

Here, U = {z G C : \z\ ^ 1} is the unit disk, and d 2 z/ir stands for the uniform measure on 
U (in other words, d 2 z = dxdy for x, y G M such that z = x + iy). 

By using the elementary relations: for every integers n, m ^ 0, 

I f z n 1 m d 2 z = f (n + I)' 1 if m = n 
it J v \ otherwise, 

one sees that our Theorem 11.11 can be reformulated by saying that 

{Tr(f(X N )) - E [Tr(f(X N ))) : / G Pol(C)} f -^4 {Z(f) : / G Pol(C)}, (1.5) 

where the covariance structure of {Z(f) : f G Pol(C)} is given by (jl ,4j) . It follows that 
Theorem 11.11 roughly agrees with Theorem 11.31 however, we stress that the framework of 
|30j is different from ours, since the findings therein cannot be applied to the real case due 
to the assumption that real and imaginary parts of entries must possess a joint bounded 
density. In addition, also note that (differently from |30| ) we do not introduce in the 
present paper any requirement on the absolute continuity of the law of the real random 
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variable X, so that the framework of our Theorem 11.11 contemplates every discrete random 
variable with values in a finite set and with unit variance. 



2. One should of course compare the results of this paper with the CLTs involving 
traces of Hermitian random matrices, like for instance Wigner random matrices. One 
general reference in this direction is the fundamental paper by Anderson and Zeitouni 
[3], where the authors obtain CLTs for traces associated with large classes of (symmetric) 
band matrix ensembles, using a version of the classical method of moments based on graph 
enumerations. It is plausible that some of the findings of the present paper could be also 
deduced from a suitable extension of the combinatorial devices introduced in [3] to the case 
of non-Hermitian matrices. However, proving Theorem 11.11 using this kind of techniques 
would require estimates for arbitrary joint moments of traces, whereas our approach merely 
requires the computation of variances and fourth moments. Also, the findings of [3] do not 
allow to directly deduce bounds such as (II. 3p . We refer the reader e.g. to Guionnet [T7] 
or to Anderson et al. [2], and the references therein, for a detailed overview of existing 
asymptotic results for large Hermitian random matrices. 

3. The general statement proved by Chatterjee in [HI Theorem 3.1] concerns the nor- 
mal approximation of linear statistics of random matrices that are possibly non-Hermitian. 
However, the techniques used by the author require that the entries can be re-written as 
smooth transformations of Gaussian random variables. In particular, the findings of [8j do 
not apply to discrete distributions. On the other hand, the results of [S] also provide uni- 
form bounds (based on Poincare-type inequalities and in the total variation distance) for 
one-dimensional CLTs. Here, we do not introduce any requirements on the absolute con- 
tinuity of the law of the real random variable X, and we get bounds for multi-dimensional 
CLTs. 



4. Let us denote by {Xj(N) : j = 1,...,N} the complex-valued (random) eigenvalues 



r - 

z k dfi XN (z) = J2^m k = MX k N ), N>2, k^l, (1.6) 

J .7=1 



of Ajv, repeated according to their multiplicities. Theorem 11.11 deals with the spectral 
moments of Xjy, that are defined by the relations: 

N 

N x 1 

3= 

where [xx N denote the spectral measure of X^. Recall that 
1 N 

where 5 Z (-) denotes the Dirac mass at z, and observe that one has also the alternate 
expression 

N 

Tr(X k N ) = N^ X ili2 X i2i3 ■ ■ ■ X ihh . (1.8) 

il,...,i k =l 
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It follows that our Theorem 11.11 can be seen as a partial (see Remark 11.21 (4) above) 
characterization of the Gaussian fluctuations associated with the so-called circular law, 
whose most general version has been recently proved by Tao and Vu: 

Theorem 1.4 (Circular law, see [35J) LetX be a complex-valued random variable, with 
mean zero and unit variance. For N ^ 2, let be defined as in M.l\) . Then, as N — )■ oo ; 
the spectral measure fix N converges almost surely to the uniform measure on the unit disk 
U = {z G C : \z\ ^ 1}. The convergence takes place in the sense of the vague topology. 

To see why Theorem 11.11 concerns fluctuations around the circular law, one can proceed 
as follows. First observe that, since E(X 2 ) = 1 and E(X 4 ) < oo by assumption, one can 
use a result by Bai and Yin |U Theorem 2.2] stating that, with probability one, 

limsup max \\j(N)\ < 1. (1.9) 

Now fix a polynomial p(z). Elementary considerations yield that, since (11.91) and the 
circular law are in order, with probability one 

±Tr(p(X N )) -+ - ( p(z)d 2 z = p(0). (1.10) 

M 7T J v 

On the other hand, it is not difficult to see that, for every k ^ 1 and as iV — > oo, 

1 



E 



z k dnx N {z) 



E 



:Tr(X£) 



->■ 



N' 

(one can use e.g. the same arguments exploited in the second part of the proof Proposition 



13. II below). This implies in particular, for every complex polynomial p 
E 



p(0) = - / p(z)d' 2 z. (1.11) 



7T 



By (jl.lOp and (11. lip , one has therefore that the quantities j^Tr(p(X]y)) and E (j^Ti (p(Xjsr))) 
both converge to p(0), and (jl.5p ensures that, for N sufficiently large, the difference 

Tr(p(X N )) - Np(0) - [E (Tr(p(X N ))) - Np(0)} 

has approximately a centered Gaussian distribution with variance - f v \p'(z)\ 2 d 2 z. Equiva- 
lently, one can say that the random variable -^Tt(p(Xn)) tends to concentrate around its 
mean as N goes to infinity, and (I1.5P describes the Gaussian fluctuations associated with 
this phenomenon. 

On the other hand, one crucial feature of the proof of the circular law provided in 
|35| is that it is based on a universality principle. This result basically states that, under 
adequate conditions, the distance between the spectral measures of (possibly perturbed) 
non-Hermitian matrices converges systematically to zero, so that Theorem 11.41 can be es- 
tablished by simply focussing on the case where X is complex Gaussian (this is the so-called 
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Ginibre matrix ensemble, first introduced in [IS]). It is interesting to note that our proof of 
Theorem II. II is also based on a universality result. Indeed, we shall show that the relevant 
part of the vector on the LHS of (11.21) (that is, the part not vanishing at infinity) has the 
form of a collection of homogeneous sums with fixed orders. This implies that the CLT 
in (II. 2p can be deduced from the results established in |25j . where it is proved that the 
Gaussian Wiener chaos has a universal character with respect to Gaussian approximations. 
Roughly speaking, this means that, in order to prove a CLT for a vector of general ho- 
mogeneous sums, it is sufficient to consider the case where the summands are built from 
an i.i.d. Gaussian sequence. This phenomenon can be seen as a further instance of the 
so-called Lindeberg invariance principle for probabilistic approximations, and stems from 
powerful approximation results by Rotar' [32] and Mossel et al. [23] . See the forthcoming 
Section [2] for precise statements. 

5. We finish this section by listing and discussing very briefly some other results related 
to Theorem 11.11 taken from the existing probabilistic literature. 

- In Rider [29] (but see also Forrester [ 13J ) , one can find a CLT for (possibly discon- 
tinuous) linear statistics of the eigenvalues associated with complex random matrices 
in the Ginibre ensemble. This partially builds on previous findings by Costin and 
Lebowitz [10] . 

- Reference [3 1 J , by Rider and Virag, provides further insights into limit theorems 
involving sequences in the complex Ginibre ensemble. In particular, one sees that re- 
laxing the assumption of analyticity on test functions yields a striking decomposition 
of the variance of the limiting noise, into the sum of a "bulk" and of a "boundary" 
term. Another finding in [31 J is an asymptotic characterization of characteristic 
polynomials, in terms of the so-called Gaussian free field. 

- Finally, one should note that the Gaussian sequence Z in Theorem 11.11 also appears 
when dealing with Gaussian fluctutations of vectors of traces associated with large, 
Haar-distributed unitary random matrices. See e.g. [ 1 1 J and [12J for two classic 
references on the subject. 

1.3 Proof of Theorem ll.lt the strategy 

In order to prove (ll.2p (and (ll.3p as well), we use an original combination of techniques, 
which are based both on the universality results of [25] and on combinatorial considerations. 
The aim of this section is to provide a brief outline of this strategy. 

For N ^ 1, write [N] = {1, N}. For k ^ 2, let us denote by D$ the collection of 
all vectors i = (ii, . . . G [N] k such that all pairs (i a ,i a+ i), a = 1, . . . , k, are different 
(with the convention that ik+i = that is, i G if and only if (i a ,i a +i) ^ {ib,ib+i) 
for every a ^ b. Now consider the representation given in (ll.8p and, after subtracting the 
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expectation, rewrite the resulting expression as follows: 
Tr(X^) - E [Tr(X k N )] 



N 

k 



N 2 ^2 (^1*2^213 ' ' ' X ikil - E[X hi2 X i2i3 ■ ■ ■ X ikh ]) (1-12) 

h,...,i k =l 

N 2 ^ ] X; Lr i 2 Xi 2 i 3 ■ ■ ■ Xi k i x 

+N 2 ^2 (Xi li2 Xi 2 i 3 • ■ ■ X ikil — E[X ili2 Xi 2i3 ■ ■ ■ X ikil ]). (1.13) 



Our proof of (II. 2p is based on the representation (I1.12p - (jl.l3p . and it is divided in two 
(almost independent) parts. 

I. In Section [31 we shall prove that the following multi-dimensional CLT takes place for 
every integers 2 ^ k\ < ... < k m : 

( 

i=l ieD (H) 

\ 

at-^t- Y Y Y 



V 



Law 



J 



for Z = {Zi : i ^ 1} as in Theorem 11.11 In order to prove (11. 14ft . we apply the univer- 
sality result obtained in [25] (and stated in a convenient form in the subsequent Section 
[2]). This result roughly states that, in order to show (11.1 4[) in full generality, it is sufficient 
to consider the special case where the collection X = {Xij : i,j ^ 1} is replaced by an 
i.i.d. centered Gaussian family G = {G?y : i,j ^ 1}, whose elements have unit variance. 
In this way, the components of the vector on the LHS of (I1.14p become elements of the 
so-called Gaussian Wiener chaos associated with G: it follows that one can establish the 
required CLT by using the general criteria for normal approximations on a fixed Wiener 
chaos, recently proved in [24] ^ZT\ [28]. Note that the results of [24, [27J |2B] can be described 
as a "simplified method of moments": in particular, the proof of (j!.14j) will require the mere 
computation of quantities having the same level of complexity of covariances and fourth 
moments. 

II. In Section HJ we shall prove that the term (11.131) vanishes as iV — > oo, that is, for 
every k ^ 2, 

R N (k) := Yl ( X ^X i2i3 ■ ■ ■ X ikh - E[X hi2 X i2i3 ■ ■ ■ X ikh }) -> in L 2 (fi). 

(1.15) 
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The proof of (I1.15P requires some subtle combinatorial analysis, that we will illustrate 
by means of graphical devices, known as diagrams. Some of the combinatorial arguments 
and ideas developed in SectionH]should be compared with the two works by Geman |14|I15|. 

Then, the upper bound (11.31) will be deduced in Section POl from the estimates obtained 
at the previous steps. 



1.4 An application to almost sure central limit theorems 

As already pointed out, one of the main advantages of our approach is that it yields explicit 
estimates for the normal approximation of vectors of traces of large random matrices - see 
e.g. relation (II. 3p . In Section \5\ we shall show that these estimates can be effectively 
used in order to deduce multivariate almost sure central limit theorems (ASCLTs), such as 
the one stated in the forthcoming Theorem 11.51 In particular, this result involves powers 
of non-Hermitian random matrices and sheds further light on the asymptotic behavior of 
their traces. To the best of our knowledge, Theorem 11.51 is the first ASCLT ever proved in 
the context of traces of random matrices. 



Theorem 1.5 Fix m ^ 1, as well as integers k m > . . . > k\ ^ 1, and let the Gaussian 
vector (Zfa, . . . , Z km ) be defined as in Theorem \l.l[ Then, a.s.-P , 



log 



I N 1 

7N E -^Mn)-E [T*(4*)] , • • • , Tr(Afr)-E [Tt(4rt] )^E[p{Z kl ,..., Z km )} . 



n=l 



(1.16) 

as N — > oo, for every continuous and bounded function if : W 71 — > R. 

Remark 1.6 1. Fix m ^ 1 and, for every N ^ 1, denote by the discrete random 
measure on M m assigning mass (nlog(iV)) -1 to the points 

(Tr(^) - E [Tt(A^)] , . . . , Tr(At) - E [Tr(A^)] ) , n = l,...,N. 

Then, the usual characterization of weak convergence imply that relation (I1.16P is 
indeed equivalent to saying that, a.s.-P, the measure pN converges weakly to the law 
of (Zk 1} • • • , Zk m ), as iV — > oo. For instance, by specializing (I1.16P to the case m = 1 
one obtains that, a.s.-P, 

1 N 1 

WjV E - 1 mA^)-E[Tr(A^ X } > P[Zk < x] , 

S n=l 

as — > oo, for every integer k ^ 1 and every real x. 
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2. The content of Theorem 11.51 should be compared with the following well-known AS- 
CLT for usual partial sums. Let (X n ) n ^x be a sequence of real-valued independent 
identically distributed random variables with E[X n ] = and E[X%\ = 1, and write 
S n = 77^^]fc = iAfc. Then, almost surely, for any bounded and continuous function 
Lp : R -HR, 



The asymptotic relation (j!.17j) was first stated by Levy [22] without proof, and then 
forgotten for almost fifty years. It was then rediscovered by Brosamler [7] and Schatte 
[33] and finally proved in its present form by Lacey and Philipp [21]. We refer the 
reader to Berkes and Csaki [6] for a universal ASCLT covering a large class of limit 
theorems for partial sums, extremes, empirical distribution functions and local times 
associated with independent random variables. The paper by Hormann [TH] contains 
several insights into the existing literature on the subject. 

3. As demonstrated in Section[5l in order to prove Theorem 1 1.51 we shall make a substan- 
tial use of a result by Ibragimov and Lifshits [19J, providing a criterion for ASCLTs 
such as ( I1.17p . not requiring that the random variables S n have the specific form 
of partial sums, nor that G is normally distributed. Our approach is close to the 
one developed by Bercu et al. [5], in the context of ASCLTs on the Wiener space. 
One should also note that [5J only deals with ASCLTs involving sequences of single 
real- valued random variables (and not vectors, as in the present paper). 

The rest of the paper is organized as follows. In Section |2] we present the universality 
results proved in [25], in a form which is convenient for our analysis. Section [3] contains 
a proof of f ll,14p . Section H] deals with (|1.15|) . whereas Section focuses on the proof of 
Theorem 11.51 

2 Main tool: universality of Wiener chaos 

In what follows, every random object is defined on an adequate common probability space 
(fl, P). The symbols E and 'Var' denote, respectively, the expectation and the variance 
associated with P. Also, given a finite set B, we write \B\ to indicate the cardinality of B. 
Finally, given numerical sequences aN,b^, N ^ 1, we write a n ~ &jv whenever ajv/fcjv — > 1 
as N — > oo. 

We shall now present a series of invariance principles and central limit theorems involv- 
ing sequences of homogeneous sums. These are mainly taken from [25] (Theorem I2.2p . |28] 
(Theorem 12. 4p and |27| (Theorem I2.6p . Note that the framework of [25J is that of random 
variables indexed by the set of positive integers. Since in this paper we mainly deal with 
random variables indexed by pairs of integers (i.e., matrix entries) we need to restate some 
of the findings of [25] in terms of random variables indexed by a general (fixed) discrete 
countable set A. 




71=1 



(1.17) 
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Definition 2.1 (Homogeneous sums) Fix an integer k ^ 2. Let Y = {Y a : a G A} 

be a collection of square integrable and centered independent random variables, and let 
/ : A k — > R be a symmetric function vanishing on diagonals (that is, /(ai, Ofc) = 
whenever there exists 7^ j such that = Oj), and assume that / has finite support. The 
random variable 

Q k {f,Y) = Yl f(ai,..,a k )Y ai ---Y ak = ^ k\f( ai ,...,a k )Y ai ---Y ak 

ai,...,a fc eA {a 1 ,...,a k }cA k 

(2.18) 

is called the homogeneous sum, of order k, based on / and Y. Clearly, E[Q k (f, Y)] = 
and also, if E(Y^) = 1 for every a G A, then 

£[Q fc (/,Y) 2 ] = fc!||/||L (2.19) 

where, here and for the rest of the paper, we set 



5^ f 2 (ai,... } a k ). 



Now let G = {G a : a G A} be a collection of i.i.d. centered Gaussian random variables 
with unit variance. We recall that, for every k and every /, the random variable Q k (f, G) 
(defined according to ( I2.18P ) is an element of the kth Wiener chaos associated with G. See 
e.g. Janson j20] for basic definitions and results on the Gaussian Wiener chaos. The next 
result, proved in [25J, shows that sequences of random variables of the type Q k (f, G) have 
a universal character with respect to normal approximations. The proof of Theorem 12.21 
is based on a powerful interaction between three techniques, namely: the Stein's method 
for probabilistic approximations (see e.g. [9]), the Malliavin calculus of variations (see e.g. 
|26|). and a general Lindeberg-type invariance principle recently proved by Mossel et al. 
in m\. 



Theorem 2.2 (Universality of Wiener chaos, see |25| ) Let G = {G a : a G A} be a 

collection of standard centered i.i.d. Gaussian random variables, and fix integers m ^ 1 and 
ki, k m ^ 2. For every j = 1, ...,m, let {/$ : N ^ 1} be a sequence of functions such that 
fjp : A kj — > R is symmetric and vanishes on diagonals. We also suppose that, for every 
j = l,...,m, the support of , denoted by supp(/$ ), is such that |supp(/^' ) )| — > 00, 
as N —t- 00. Define Q kj {f^\G) , N ^ 1, according to &2.18\) . Assume that, for every 
j = 1, ...m, the following sequence of variances is bounded: 

E[Q k .(f ( j\G) 2 ], N>1. (2.20) 

Let V be amxm non-negative symmetric matrix, and let ^4^(0, V) indicate a m- dimensional 
centered Gaussian vector with covariance matrix V. Then, as N — > 00, the following two 
conditions are equivalent. 
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(1) The vector {Q k , G) : j — 1, ...,m} converges in law to ^4^(0,1^) . 

(2) For every sequence X = {X a : a G A} of independent centered random variables, with 
unit variance and such that snp a E\X a \ 3 < oo, the law of the vector {(^-(/jv^X) : 
j = l,...,m} converges to the law of jV m {§, V). 

Note that Theorem 12.21 concerns only homogeneous sums of order k ^ 2: it is easily seen 
(see e.g. [25, Section 1.6.1]) that the statement is indeed false in the case k = 1. However, 
if one considers sums with a specific structure (basically, verifying some Lindeberg-type 
condition) one can embed sums of order one into the previous statement. A particular 
instance of this fact is made clear in the following statement, whose proof (combining the 
results of [25] with the main estimates of |23j) is standard and therefore omitted. 

Proposition 2.3 For m ^ 1, let the kernels : N ^ 1}, j = l,...,m, verify the 

assumptions of Theorem \2.2\ Let {a^ : i ^ 1} be an infinite subset of A, and assume 
that condition (1) in the statement of Theorem \2.2\ is verified. Then, for every sequence 
X = {X a : a G A} of independent centered random variables, with unit variance and such 
that sup a -EIXqI 3 < oo ; as N — )■ oo the law of the vector {Wn Qkj(fiP, X) : j = 1, m}, 
where Wn = 77^X^=1^0,; converges to the law of {N ; Nj : j = l,...,m} ; where N Q ~ 
^(0, 1), and (Ni, N m ) ~ ^4^(0, V) denotes a centered Gaussian vector with covariance 
V, and independent of N . 

Theorem 12.21 and Proposition 12.31 imply that, in order to prove a CLT involving vectors 
of homogeneous sums based on some independent sequence X, it suffices to replace X with 
an i.i.d. Gaussian sequence G. In this way, one obtains a sequence of random vectors 
whose components belong to a fixed Wiener chaos. We now present two results, showing 
that proving CLTs for this type of random variables can be a relatively easy task: indeed, 
one can apply some drastic simplification of the method of moments. The first statement 
deals with multi-dimensional CLTs and shows that, in a Gaussian Wiener chaos setting, 
componentwise convergence to Gaussian always implies joint convergence. See also [TJ for 
some connections with Stokes formula. 

Theorem 2.4 (Multidimensional CLTs on Wiener chaos, see [25, 28J) Let the fam- 
ily G = {G a : a G A} be i.i.d. centered standard Gaussian and, for j = l,...,m, define 
the sequences Qkjifw , G) ; N ^ 1, as in Theorem \2.2\ (in particular, the functions fjp 
verify the same assumptions as in that theorem). Suppose that, for every i,j = 1, ...,m, as 
N -> 00 

E[Q ki (f® ,G) x Q k .(f%\G)] -> V(i,j), (2.21) 

where V is a m x m covariance matrix. Finally, assume that Wn, N ^ 1, is a sequence of 
t/K(0, 1) random variables with the representation 

W N = J^wjv(a) x G a , 
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where the weights (a) are zero for all but a finite number of indices a, and YlaeA %( fl ) 2 = 
1. Then, the following are equivalent: 

(1) The random vector {Wn ', Q^if^i G) : j — 1, m} converges in law to {No ; Nj : 
j = 1, m}, where Nq ~ ^(O, 1), and (N\, ...,N m ) ~ ^m(0, V) denotes a centered 
Gaussian vector with covariance V , and independent of Nq . 

(2) For every fixed j = l,...,m, the sequence Qkjifff ;G) ; N ^ 1, converges in law to 
Z ~ ^(O, V(j, j)) , that is, to a centered Gaussian random variable with variance 
V(jJ). 

The previous statement implies that, in order to prove CLTs for vectors of homoge- 
neous sums, one can focus on the componentwise convergence of their (Gaussian) Wiener 
chaos counterpart. The forthcoming Theorem 12.61 shows that this type of one-dimensional 
convergence can be studied by focussing exclusively on fourth moments. To put this result 
into full use, we need some further definitions. 

Definition 2.5 Fix k ^ 2. Let / : A k — > R be a (not necessarily symmetric) function 
vanishing on diagonals and with finite support. For every r = 0, k, the contraction f* r f 
is the function on A 2d ~ 2r given by 

/★ r /(a 1( ...,a 2 d-2r) (2.22) 

= £ /(ai,... )/(Ofc-r+l> a 2 d-2r, ^1, X r ). 

(a:i,...,3v)eA r 

Observe that (even when / is symmetric) the contraction /* r / is not necessarily symmetric 
and not necessarily vanishes on diagonals. The canonical symmetrization of f* r f is written 

r*j. 

Theorem 2.6 (The simplified method of moments, see [27]) Fix k ^ 2. Let G = 

{G a : a G A} fre an i.i.d. centered standard Gaussian family. Let {f^ : N ^ 1} be a 
sequence of functions such that /jy : A k — > R zs symmetric and vanishes on diagonals. 
Suppose also that |supp(/jv)| — > oo, as N — >■ oo. Assume that 

E[Q k {fN, G) 2 ] -»■ ^ 2 > 0, as iV ^ oo. (2.23) 
Then, the following three conditions are equivalent, as N — >■ oo. 

(1) T/ie sequence Qk{fN, G), N ^ 1, converges in law to Z ~ ^(0, a 2 ). 

(2) E[g fc (/ W ,G) 4 ] ^3a 4 . 

(3) For every r = 1, fc - 1, ||/jv * r /jv||2A-2r -> 0. 



Finally, we present a version of Theorem 12.21 with bounds, that will lead to the proof 
of Theorem ll.U -(ii) provided in Section WM 
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Theorem 2.7 (Universal bounds, see |25| ) Let X = {X a : a E A} be a collection of 
independent centered random variables, with unit variance and such that (3 := sup a i?!^! 3 < 
oo. Fix integers m ^ 1, k m > ... > k\ ^ 2. For every j = 1, ...,m, let : A kj — > R be 
a symmetric function vanishing on diagonals. Define Q J (X) := Qfc.(/W,X) according to 
\2. 18\) . and assume that E[Qi(X.) 2 ] = 1 for all j = 1, . . . , m. Also, assume that K > is 
given such that ^] agj 4 maxx^j<g m Inf a (/ (J ')) < K, where 



E / 



'(i.(i2- ■ ■ ■ ,a kj ) 2 



{a 2 ,...,a k }CA i 



-1! ^ . 



(a,a 2 , ■ ■ . } a ki ) 2 . 



a 2 ,...,a k &A 



Let ip : 



be a thrice differentiate function such that ||</?"||oo + \\ i f h 



< oo, 



with ||v5 (fe) ||oo = max H=A . i sup zgRm \d a (p(z)\. Then, for Z = (Z 1 ,...,^" 1 ) ~ Jf m {$,l n 
(standard Gaussian vector on t m J, we have 



\E[ip(Q\X),...,Q m (X))}-E[cp(Z)]\^\\ip^(J2^ii + 2 E ^ 



J=l 



+K\\cp" 



(3 + 



^(16^/3)^- 
.i=i 



max rnaxInf a (/W) 

l^j^m aeA 



where Ay, 1 ^ « ^ j ^ m, zs given fry 



v \ 



r — 1 



+ k . - 2r)!(||/« * fcl _ r / (i) || 2r + ||/ (j) * fcj - r / (j) l| 2r ) 



We finish this section by a useful result, which shows how the influence Inf a / of / : 
A k — » R can be bounded by the norm of the contraction of / of order k — 1: 

Proposition 2.8 Let / : A k — > R 6e a symmetric function vanishing on diagonals. Then 
(k - 1)! maxlnf a (/) := max V] /(a, a 2 , • • • , a k ) 2 < ||/ * fc _i /|| 2 . 

aSA <r(=4 ^ — ' 



aeA 



<22,...,afc£:A 



Proof. We have 

ii/**-i/H2 = E 



a,beA 



E 

aeA 



E f(a,a2,---,a k )f(b,a 2 ,...,a k ) 



E / 2 ( a ' a 2 5 

.a2,...,a fe eA 



^ max 

aeA 



E f 2 (a,a 2 ,...,a k ) 

.<22,. ..,aj.£A 



n 2 



(fc-l)!maxlnf a (/) 

aeA 
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As a consequence of Theorem 12 . 71 and Proposition 12. S\ we immediately get the following 
result. 

Corollary 2.9 Let X = {X a : a E A} be a collection of independent centered random 
variables, with unit variance and such that (3 := sup a E\X a \ 3 < oo. Fix integers m ^ 1, 
k m > ... > k\ ^ 1. For every j = 1, ...,m, let '■ N ^ 1} be a sequence of functions such 
that fjp : A k] — > R is symmetric and vanishes on diagonals. Define Q-* N (X) := Qkjif^i 
according to Ii2.18\) . and assume that E[Q J N (X.) 2 ] = 1 for all j — 1, . . . ,m and N ^ 1. Let 
(p : W m ~R be a thrice differentiable function such that \\<p"\\oo + ||<£>'"||oo < oo. If, for 
some a > 0, \\fjp *k 3 -r II 2r — 0(N~ a ) for all j = 1, . . . ,m and r = 1, . . . , kj — 1, then, 
by noting (Z 1 , . . . , Z m ) a centered Gaussian vector such that E[Z l Z^} = if i ^ j and 
E[(Zi) 2 ] = 1, we have 

I^MQ^X), . . . , QJf(X))] - E[<p{Z\ Z m )} | = 0(N- a ' 2 ). 

3 Gaussian fluctuations of non-diagonal trace compo- 
nents 

Our aim in this section is to prove the multidimensional CLT (j!.14p . by using the uni- 
versality results presented in Section [2J To do this, we shall use an auxiliary collection 
G = {Gij : i,j^ 1} of i.i.d. copies of a ^(0, 1) random variable. 

As in Section 1X73]. for a given integer k ^ 2, we write to indicate the set of vectors 
i = (ii, . . . ,ik) G [N] k such that all the elements (i a ,i a+ i), a = 1, . . . , k, are different in 
pairs (with the convention that i k+ i = We have the following preliminary result: 

Proposition 3.1 For any fixed integer k ^ 2, 

N~ k ' 2 G ^2 ■ ■ ■ G ikil ±^Z k ~ jf(0, k) asN^oo. 

Remark 3.2 When k = 1, the conclusion of the above proposition continues to be true, 
since in this case we obviously have 

N 

N-V^Gu- ^(0,1). 

i=l 

Proof of Proposition \3.1\ The main idea is to use the results of Section [2J in the special 
case A = N 2 , that is, A is the collection of all pairs (i,j) such that i,j ^ 1. Observe that 

jy k/2 G ili2 . . . G ikil = Qk(fk,N, G), 
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with fk.N '■ ([N} 2 ) k — ► K the symmetric function defined by 



where we used the notation 
fk%(( a hbi), ■ ■ ■ , (a>k,h)) = N ~ k/2 ^2 1 fc(i)=«i:V(i)+i=^}-'' 1 {v(i i )=«i.vw+i^}' ( 3 - 25 ) 



and &k denotes the set of all permutations of [k]. Hence, by virtue of Theorem I2.6[ to 
prove Proposition 13 . 1 1 it is sufficient to accomplish the following two steps: (Step 1) prove 
that property (3) (with f^N replacing /jv) in the statement of Theorem 12.61 takes place, 
and (Step 2) show that relation (I2.23P (with f^jq replacing fjq) is verified. 

Step 1. Let r£{l,...,fc-l}. For a,r E &k, we compute 

fk%*r fk%{( X l>yi)' • • ■ ) (X2k-2r,y2k-2r)) (3.26) 
AT— k \ ^ -I -1 

iV / j - L {V(l)= 2: li i <T(l) + l=yi} • • • J -{i CT (fc_ r )=2:fc-r, ia(k-r) + l=Vk-r} 

X ^-{3T(l)= x k-r+l,3T(l) + l=yk-r + l} ' ' ' ^ {jr (k -r) =%2k~2r , j T (k-r) + l=V2k -2r } 

X l{V(fc-r + l)=jT(fc-r + l)> i<T(fc-T- + l) + l=jV(fc-r- + l) + l} ' ' ' ^ { V(fc) =Jt (fc) ' V(fc) + 1 =jr (fc) + 1 } ' 

We now want to assess the quantity Wfj^N^r /i^jv II Ifc— 2r-- T° do this, we exploit the represen- 
tation (I3.26P in order to write such a squared norm as a sum over ([iV] fc ) 4 : as a consequence, 

one deduces that ||/^* r /^||| fc _ 2r < l^jv ^l N~ 2k where F^' a ' T) is the subset of ([N} k ) 4 
composed of those quadruplets (i,j,a, b) such that 

2o-(l) = flo-(l), V(l)+1 = a a(l)+l, • • • j ia(k-r) = G<r(fc— r)j V(fc-r)+l = a<r(fc-r)+l 
jr(l) = &t(1)> Jr(l)+1 = &r(l)+l> • • • ) jr(k-r) = K(k-r), jr(k-r)+l = K{k-r)+l 

r+1) = J-r(fc-r+l)> 2cr(fc-r+l)+l = Jr(fe— r+l)+l) ■ ■ ■ > l a(k) = Jr(k)i i a(k)+l = Jr(fe)+1 
a cr(k-r+l) — W(k-r+l)i a cr(k-r+l)+l = b T (k-r+l)+l, ■ ■ ■ > 0<r(fc) = ^r(fc), a o-(fc)+l = O r (k)+l- 

(3.27) 

It is immediate that, among the equalities in (13.271) . the 2k equalities appearing in the 
forthcoming display (13.28)) are pairwise disjoint (that is, an index appearing in one of the 
equalities does not enter into the others): 

— a cr(l), ■ ■ ■ , i<r{k-r) — a a{k-r)i jr(l) = K(l), ■ ■ ■ , jr(k-r) = K(k-r) 
V(fe-r+l) — 3r(k-r+l), ■ ■ ■ , V(fc) = Jr(fe)) ^(fc-r+l) = K(k~r+l), ■ ■ ■ , 0>tr(k) = K(k)- 

(3.28) 

Hence, the cardinality of F^'^ is less than N 2h , from which we infer that #*r/fc jvllifc-2r 
is bounded by 1. This is not sufficient for our purposes, since we need to show that 
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Il/jfcjv * r /fc^Af II Ifc— 2r tends to zero as iV — > oo. To prove this, it is sufficient to extract 
from (I3.27P one supplementary equality which is not already written in ()3.28|) . We shall 
prove that this equality exists by contradiction. Set L = {<j(s) : 1 ^ s ^ k — r} and 
R = {(t(s) + 1: 1 ^ s ^ k — r} (with the convention that k + 1 = 1). Now assume that 
R = L. Then cr(l) + 1 G R also belongs to L, so that er(l) + 2 6 £ By repeating this 
argument, we get that L = R = [k], which is a contradiction because r ^ 1. Hence, R ^ L. 
In particular, the display (13.271) implies at least one relation involving two indices that are 
not already coupled in (j3.28| . This yields that the cardinality of F^' a ' T ^ is at most iV 2fc_1 , 
and consequently that \\fjfjlf * r II lfc--2r- ^ This fact implies immediately that the 

norms \\f k>N * r f k ,Nhk-2r, r = 1, . . . , k - 1, verify 

\\fk,N*r fk,Nhk-2r = O^ 1 / 2 ), (3.29) 

and tend to zero as N — > oo. In other words, we have proved that condition (3) in the 
statement of Theorem 12.61 is met. 

Step 2. We have 



Var 



/ \ 



N ^ ] ^[^1112 • • • Gikh^jlj2 ■ ■ ■ Gj k ji\ 



For fixed i, j G observe that the expectation E\Gi x i 2 ■ ■ ■ Gi k i 1 Gj 1 j 2 . . . Gj k j 1 ] can only 

be zero or one. Moreover, it is one if and only if, for all s£ [k], there is exactly one t G [k] 
such that (i s ,i s+ i) = (jt,jt+i)- In this case, we define a G &k as the bijection of [k] into 
itself which maps each s to the corresponding t and we have, for all s G [k], 

is = ja(s) = Ms-1)+1- (3-30) 
To summarize, one has that Var (^N~ k / 2 X] i6Z) ( fc ) G^i 2 • • • ^MiJ equals 

N ~ k E K^'j) e : = ti*W,Ms)+i) for all s G [A:]}|. (3.31) 

If a G &k is such that a(s) = a(s — 1) + 1 for all s (it is easily seen that there are exactly k 
permutations verifying this property in &k), we get k different conditions by letting s run 
over [k] in f)3.30p . so that 

{(i,j) G {Dff : (i s ,i s+1 ) = (Ms),Ms)+i) for all s G [k]} = A^ + O^" 1 ), as N 00. 

In contrast, if a G &k is not such that a(s) = a(s — 1) + 1 for all s, then by letting s run 
over [k], one deduces from (I3.3(jp at least k + 1 different conditions, so that, in this case, 

{(i,j) G (Z^) 2 : (z 8 ,i, +1 ) = (m s) ,Ms) + i) for all s G [fc]} = O^ 1 ), as N -> 00. 
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Taking into account these two properties together with the representation (I3.3ip . we deduce 
that the variance of 

n k/2 ^2 ^ hi2 ■ ■ ■ 

tends to k as N — > oo. It follows that the required property f!2.23j) in Theorem 12.61 (with 
a 2 = k) is met. 

The proof of Proposition 13.11 is concluded. □ 



Remark 3.3 By inspection of the previous proof, one also deduces that, for every k ^ 2, 
there exists a constant Ck (independent of N) such that, for all iV ^ 1, 

/ \ 



Var 



\ 



ieD 



(k) 



k 



J 



TV 



(3.32) 



The multidimensional version of Proposition 13.11 reads as follows: 

Proposition 3.4 Fix m ^ 1, as well as integers k m > . . . > ki ^ 2. Then, as N — >■ oo ; 
/ 

N-^Y^Gu, IT* G hi2 ■ ■ ■ G ikih , . . . (3.33) 



V 



i=l 



(fci) 



N-^ V G G ■ 



ieD 



(k m ) 



Law 



(Zl, Zfa, Zk m ) ■ 



J 



where Z = {Zk '■ k ^ 1} denotes a collection of independent centered Gaussian random 
variables such that, for every k ^ 1, E(Zl) = k. 



Proof. It is an application of Theorem I2.4[ in the following special case: 

- WN(i,j) = if i = J ^ N and ifjv(«, j) = otherwise; 

- V is equal to the diagonal matrix such that V(a, b) = if a ^ b and V(a,a) = k a , 
for a = 1, m; 

- for j = 1, m, fjp = fk ,Nj where we used the notation (I3.24p . 



Indeed, in view of Proposition 13. 1[ one has that condition (2) in the statement of Theorem 
I2.4l is satisfied. Moreover, for fixed a ^ b and since G consists of a collection of independent 
and centered (Gaussian) random variables, it is clear that, for all N, 



E 



Gi x i 2 . . . Gi ka i x x Gj 1 j 2 . . . Gj kb j 1 



(ka) 



so that condition (I2.2ip is met. The proof is concluded. 
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By combining Proposition 13.41 and Proposition 12. 3) we can finally deduce the following 
general result for non-diagonal trace components. 

Corollary 3.5 For N ^ 2, let be the N x N random matrix given by ( fj.jp . where 
the reference random variable X has mean zero, unit variance and finite absolute third 
moment. Fix m ^ 1, as well as integers 2 ^ k\ < . . . < k m . Then, the CLT ft!.14\) 



takes place, with Z = {Zk : k ^ 1} denoting a sequence of independent centered Gaussian 
random variables such that, for every k ^ \, E(Z%) = k. 



Remark 3.6 In order to prove Corollary 13. 5) one only needs the existence of third mo- 
ments. Note that, as will become clear in the following Section HJ moments of higher orders 
are necessary for our proof of ( II . 15)) . 



The remainder: combinatorial bounds on partitioned 
chains and proof of Theorem 11.11 



Fix an integer k ^ 2. From section 11.3) recall that D N denotes the subset of vectors 
i = (zi, . . . ,ik) G [N] k such that all the elements (i a > Wi)> a — 1> ■ ■ ■ > k, are different in 
pairs (with the convention that i^+i = i{). From the Introduction, recall that X is a 
centered random variable, having unit variance and with finite moments of all orders. Let 
also X = {Xij : i,j ^ 1} be a collection of i.i.d. copies of X. In the present section, our 
aim is to prove the asymptotic relation (jl .15)) , that is 



Proposition 4.1 For every k ^ 2, as N — )■ oo ; 



E(R N (kf) = Var 



N ^ ^ ] [-i^Qii2 • • • -^ikh E{Xi^i 2 . . . X^ k i x 
\ ^ ) 



OiN- 1 ). (4.34) 



The proof of Proposition 14. 1 1 is detailed in Section fl~4") and builds on several combinatorial 
estimates derived in Sections I4.2H4.31 To ease the reading of the forthcoming material, we 
now provide an intuitive outline of this proof. 

Remark on notation. Given an integer k ^ 2, we denote by V(k) the collection of all 
partitions of [k] = {1, ...,&}. Recall that a partition 7r g V(k) is an object of the type 
7i = {Bi, ...,B r }, where the 5j's are disjoint and non-empty subsets of [k], called blocks, 

such that Uj=i r Bj = [k). Given a, x G [k] and n G V(k), we write a ~ x whenever a and 

x are in the same block of tt. We also use the symbol 1 to indicate the one-block partition 
1 = {[k]} (this is standard notation from combinatorics - see e.g. [34J). In this section, for 
the sake of simplicity and because k is fixed, we write instead of D^. 
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4.1 Sketch of the proof of Proposition |47T] 

Our starting point is the following elementary decomposition: 

[N] k \D N = |J A n (tt), 

where Q(k) stands for the collection of all partitions of [k] containing at least one block 
of cardinality ^ 2, and An(it) is the collection of all vectors i G [N] k such that the 
equality (i a ,i a +i) — iix^x+i) holds if and only if a ~ x. Using this decomposition, one 
sees immediately that, in order to show ()4.34j) . it is sufficient to prove that, for each fixed 
7T G Q(k), the quantity 

Var(iV- fc / 2 [X ili2 ...X ikil -E(X hi2 ...X ihh )]\ (4.35) 

= N k [ E(X hi2 . . . X ikil X jlj2 . . . X jkjl ) - E(X hi2 . . . X ikil )E(X jlj2 . . . X jkjl ) } 

(ij)eA JV ( 7 r)xA JV (7r) 

is 0(A^ _1 ), as iV — > oo. Let Gn(tt) denote the subset of pairs (i, j) G A^(tc) x A^(tt) such 
that the following non- vanishing condition is in order: 

E(X hi2 . . . X ikh X jlj2 . . . X jkjl ) - E[X Wl2 . . . X ikil )E(X jlj2 . . . X jkjl ) ^ 0. (4.36) 

Hence 

VarliV^/ 2 J2 [X hi2 ...X ikh -E(X hi2 ...X ikil )]\ (4.37) 

\ ieAjv(Tr) J 

— X ' ^ \E(Xi x i 2 . . . Xi k i 1 Xj 1 j 2 . . . Xj k jA — E{Xi x i 2 . . . Xi k i 1 )E(Xj 1 j 2 . . . Xj k j L ) ] . 
(ij)eG;v(0 

Due to the finite moment assumptions for X, and by appling the generalized Holder in- 
equality, it is clear that, for a generic pair (i, j), 

\E(X hi2 . . . X ikil X jlj2 . . . X jkjl )-E(X hi2 . . . X ikil )E(X jlj2 . . . X jkjl )\ ^ 2 E(\X\ 2 ) < oo. 

It follows that, in order to prove that the sum in (I4.37P is 0(A^ _1 ), it is enough to show 
that 

\G N {ir)\ ^ Q{k,7r)N k -\ (4.38) 

for some constant Q(k, it) not depending on N. Our way of proving (I4.38P is to show that, 
if (i, j) denotes a generic element of Gn{^), then, necessarily, there exists at least k + 1 
equalities between the 2k indices ii, . . . ,ik, ji, ■ ■ ■ ,jk of (i,j). Note that by 'equality' we 
just mean the existence of two different integers a, b G [k] such that i a = or j a = jb, or 
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the existence of two integers a, b G [k] such that i a = jt>. Proving this fact implies that 
the 2k indices of a generic elements (i, j) of G^(jr) have at most k — 1 degrees of freedom 
(see Point 7 of Section 4.2 for a precise definition), so that (I4.38j) holds immediately — the 
constant Q(k, ir) merely counting the number of ways in which the k + 1 equalities can be 
consistently distributed among the indices composing (i, j). In order to extract these k + 1 
equalities between the 2k indices of a generic element (i, j) of Gn(tt), we will consider two 
cases, according as the partition 7r G Q(k) contains at least one singleton or not. 

Case A: No singletons in it. By definition of An(tt), and due to the absence of singleton 
in 7r, we already see that there are at least k/2 or (k + l)/2 (according to the evenness of 
k) equalities between the k indices of i (resp. j). Moreover, the non- vanishing condition 
(I4.36P implies that there is at least one further equality between one index of i and one 
index of j. So, we proved the existence of k + 1 equalities between the 2k indices of (i, j), 
and the proof of (I4.38[) in the Case A is done. 

Case B: At least one singleton in tt. Let S denote the collection of the singleton(s) of n. In 
order for (I4.36P to be true, observe that, for all s G S, we must have (j s ,j s +i) — (^<n Wi) 
for some a G [k]. In particular, this means that there exist \S\ equalities of the type j s = i a 
for the indices composing (i, j). Also, by definition of the objects we are dealing with, for 
all t G [k] \ S, we must have (i t ,it+i) = (i a ,i a +i) for some a, different from t, in the same 
7r-block as t. Of course, the same must hold with % replaced by j. Hence, in order for (I4.38P 
to be true, it remains to produce one equality between indices that has not been already 
considered. We mentioned above that for all t G [k] \ S, there exists a, different from t and 
in the same block as t, such that j t = j a - Hence, to conclude it remains to show that we 
have j t = j a for at least one integer t belonging to [k] \ S and one integer a not belonging 
to the same block as t. Since, by assumption, tc contains at least one singleton and one 
block of cardinality ^ 2 (indeed, tt G Q(k)), without loss of generality (up to relabeling 
the indices according to a cyclic permutation of [k]), we can assume that 5* contains the 
singleton {k}. Consider now the singleton {s*} of S, where s* is defined as the greatest of 
the integers m such that {m} is adjacent from the right to a block, say B u *, of cardinality 
^ 2. For a particular example of this situation, see the diagram in Fig. HJ where each row 
represents the same partition of [7] having s* = 6 (see Point 3. in the subsequent Section 
14.21 for a formal construction of diagrams). To finish the proof, once again we split it into 
two cases: 

Case Bl: The block B u * contains two consecutive integers. This assumption implies that 
jx — jt — jt+i f° r ah x,t E B w . Since {a} is adjacent from the right to B UJf , we have 
ja — jt f° r all t G which is exactly what we wanted to show. 

Case B2: The block B m does not contain two consecutive integers. Fig. [7] is an illus- 
trative example of such situation, where each row represents the same partition of [8], 
with s* = 7. As we see on this picture, we have necessarily jj = j'5, yielding the desired 
additional equality, which could not be extracted from the previous discussion. In Section 
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14. 'S\ it is shown that this line of reasoning can be extended to general situations. 

Remark 4.2 The sketch given above contains all the main ideas entering in the proof 
of Proposition 14.11 The reader not interested in technical combinatorial details, can then 
go directly to Section 14. 4[ where the proof of Theorem 11.11 is concluded. The subsequent 
Sections I4.2H4.3I fill the gaps of the above sketch, by providing exact definitions as well as 
complete formal arguments leading to the estimate ( I4.34p . 

4.2 Definitions 

In the following list, we introduce some further definitions that are needed for the analysis 
developed in the rest of this section. 

1. Fix integers N, k ^ 2. A chain c of length 2k, built from [N], is an object given 
by the juxtaposition of 2k pairs of integers of the type 

C = («i , Z 2 ) («2 , *3) ■ ■ ■ («fe , * 1 ) ( Jl , J2 ) (j2 , J3 ) ■ - • ( Jfc , jl), (4-39) 

where i a ,j x G [N], for a, x — 1, k. The class of all chains of length 2k built from [N] is 
denoted by C(2k, N). As a notational convention, we will use the letter i to write the first 
k pairs in the chain, and the letter j to write the remaining ones. For instance, an element 
of C(6, 5) (that is, a chain of length 6 built from the set {1, 2, 3, 4, 5}) is 

c=(l,5)(5,l)(l,l)(3,3)(3,3)(3,3), 

where %\ = 1, i 2 = 5, i% = 1, j\ = j 2 = j% = 3. According to the graphical conventions 
given below (at Point 3 of the present list) we will sometimes say that (ii , ^2) (*2 ; 2*3). ..(ifc, i\) 
and (ji, J2)(j2, h) ■■■(jk,ji) are, respectively, the upper sub-chain and the lower sub-chain 
associated with the chain c in f)4.39p . For instance, in the previous example the upper 
sub-chain is (1, 5)(5, 1)(1, 1), whereas the lower one is (3, 3)(3, 3)(3, 3). We shall say that 
(ii, ii+i) is the Zth pair in the upper sub-chain of c (and similarly for the elements of the 
lower sub-chain). We shall sometimes call i a the left index of the pair (i a ,i a +i)- Also, we 
use the convention i^+i = %\ and jk+i = ji- Of course, a chain is completely determined 
by the left indices of its pairs. 

2. Let 7r G V(k) be a partition of [k]. Recall that, for a, b G [k], we write a ~ b to 
indicate that a and b belong to the same block of ir. We say that a chain c as in ( I4.39P 
has partition it if, for every a, b G [k], the following double implications take place: (i) 
(i a , i a+1 ) = (i b , i b+ i) if and only if a ~ b, and (ii) (j a ,j a +i) = {jb,jb+i) if and only if a ~ b. 
In other words, a chain has partition ir if and only if the partitions of [k] induced by the 
identical pairs in its upper and lower sub-chain are both equal to ir, that is (with the 
notation of Section 14. ip . if and only if ...,ik), (ji, ■•■■,3k) G A^^). For instance, take 
k = 4 and 7r = {{1, 3}, {2, 4}}. Then, the following chain built from [3] has partition tt: 

c= (1,2)(2,1)(1,2)(2,1)(3,1)(1,3)(3,1)(1,3). 
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Note the 'only if part in the definition given above, implying that, if a chain has partition 
7r and if x and y are not in the same block of tt, then necessarily (i x ,i x +i) 7^ (h^y+i) 
and (jx,jx+i) 7^ {jy-ijy+i)- This yields in particular that a chain cannot have two different 
partitions. 

3. Given k ^ 2, we shall sometimes represent a generic chain with partition tt G V(k) 
by means of diagrams. These diagrams are mnemonic devices composed of an upper row 
and a lower row, of k dots each. These rows represent, respectively, the upper and lower 
sub-chain of a given chain, in such a way that the Ith dot (from left to right) in the upper 
(resp. lower) row corresponds the /th pair in the upper (resp. lower) sub-chain. Each block 
B of the partition tt is represented by two closed curves: the first one is drawn around 
the dots of the upper row corresponding to the pairs (i a ,i a +i) verifying a6 5; the second 
one is drawn around the dots of the lower row corresponding to those {j x ,jx+i) verifying 
x G B. The resulting diagram is the superposition of two identical combinations of dots 
and curves. Note that the shape of the diagram does not depend on N. For instance, the 
diagram in Fig. [1] corresponds to the case k = 7, and tt = {{1,4,5}, {2}, {3}, {6}, {7}}^ 
whereas the diagram in Fig. [2] corresponds to k — 6 and the one-block partition 1 = {[6]}. 




Figure 1: a chain with a five-block partition 




Figure 2: a chain with a one-block partition 

4. In general, given a chain c as in (I4.39P with partition tt = {B\, ...,B r } as at Point 2 
of the present list, we shall say the the block B u of the upper sub-chain corresponds to 
the block B v of the lower sub-chain, whenever (i a ,i a +i) — {jx,jx+i) f° r every a G B u and 

3 A chain with partition it as in Fig. [T]is 

c - (1, 1)(1,2)(2, 1)(1, 1)(1, 1)(1,3)(3, 1)(1, 1)(1,4)(4, 1)(1, 1)(1, 1)(1, 5)(5, 1). 
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every x G B v . Note that one given block B u in the upper sub-chain cannot correspond to 
more than one block in the lower sub-chain. For ix = {Bi, ...,B r } G V(k), we shall now 
define a class of chains C 7T (2k,N) C C(2k,N), whose elements have partition tt and are 
characterized by two facts: the associated upper and lower sub-chains have at least one 
pair in common, and "no singletons are left on their own". Formally, the class C 7r (2k,N) 
is defined as follows (recall that we use the letter % for the elements of the upper sub- 
chain, and the letter j for the elements of the lower sub-chain), (i) If \B t \ ^ 2 for every 
t — 1, r, then C n (2k, N) is the collection of all chains of partition it verifying that there 
exists a,x G [k] such that the block B a in the upper sub-chain corresponds to the block 
B x in the lower sub-chain, (ii) If tc contains at least one singleton, then C 7r (2k,N) is the 
collection of all chains of partition n such that every singleton in the upper (resp. lower) 
sub-chain corresponds to a block of the lower (resp. upper) subchain, that is: for every 
{a} G tt, there exists u — 1, ...,r such that (i a ,i a +i) — (jhji+i) f° r every I G B u , and, for 
every {x} G tt, there exists v = l,...,r such that (J x ,jx+i) = (js,js+i) f° r every s G B v . 
For instance, if k = 3 and tt = {[3]}, then one element of CV(6, 5) is 

c=(5,5)(5,5)(5,5)(5,5)(5,5)(5,5). 

If k = 6 and tt = {{1, 2, 3}, {4}, {5}, {6}}, then one element of C w (12, 5) is 

c=(l,l)(l,l)(l,l)(l,2)(2,5)(5,l)(2,2)(2,2)(2,2)(2,5)(5,l)(l,2). 

5. Fix k,N ^ 2, as well as a partition tt = {B\, B r } G V(k). Given two subsets 
U, V C [r] such that \U\ = \V\, let R : U 4 ^ : ti 4 be a bijection from [/ 

onto V. We shall denote by C^(2k,N) the subset of C w (2k,N) composed of those chains 
c G C n (2k, N) such that the block £> u in the upper sub-chain corresponds to the block 
.Br( u ) in the lower sub-chain. When U = {u} and V = {v} are singletons, we shall simply 
write C%' v (2k, N) to indicate the set of those c G C 7r (2k,N) such that the block B u in 
the upper sub-chain corresponds to the block B v in the lower sub-chain. For instance, the 
chain 

Cl = (1, 1)(1, 1)(1, 2)(2, 5)(5, 1)(2, 2)(2, 2)(2, 5)(5, 1)(1, 2) 

is an element of C^(10, 4), where tt = {B t , B 2 , B 3 , B A } = {{1, 2}, {3}, {4}, {5}}, U — V — 
{2, 3, 4}, and R(2) = 4, i?(3) = 2 and i?(4) = 3. The chain 

c 2 = (3,3)(3,3)(3,3)(3,3) 

belongs to C i 1 ' 1 (4,3), where 1 = {B ± } = {[2]}. Note that the definition of C^(2k, N) does 
not give any information concerning the blocks of the upper and lower sub-chains that do 
not belong, respectively, to the domain and the image of R. In other words, for a chain 
c G C^(2k, N), one can have that the block B u in the upper sub-chain corresponds to the 
block B v in the lower sub-chain even if u ^ U and v ^ V. For instance, the chain 

c = (1, 1)(1, 1)(1,2)(2,5)(5, 1)(1, 1)(1, 1)(1,2)(2,5)(5, 1) 
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is counted as an element of (7^(10,4), where 

vr = {B u B 2 , B 3 , B 4 } = {{1, 2}, {3}, {4}, {5}}, 
U = V = {2, 3, 4}, and R{u) = u, for u = 2, 3, 4. 

6. Fix fc, iV 2, as well as a partition 7r = {-Bi, B r } G V(k). Given a bijection 
R : U — > V as at Point 5 above, we shall represent a generic element of the class C„(2k, N) 
by means of a diagram built as follows: first (i) draw the diagram associated with the class 
C n (2k, N), as explained at Point 3 of the present list, then (ii) for every pair of blocks B u 
and B v such that u G U, v G V and v = R(u) (note that B u is in the upper sub-chain, and 
B v in the lower sub-chain), draw a segment linking a representative element of B u with a 
representative element of B v . For instance, the class C^(10, N), associated with the chain 
c\ appearing at Point 5 above, is represented by the diagram appearing in Fig. |3j whereas 
the chain c 2 is associated with the class Cj' 1 (4, 3), whose diagram is drawn in Fig. HJ 




Figure 3: a chain with three pairs of corresponding singletons 
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Figure 4: a chain with two corresponding blocks 



7. Fix k, N ^ 2 and let C C C(2k, N) be a generic subset of C{2k, N). Let q = 1, 2k 
be an integer. We say that C has at most q degrees of freedom (or, equivalently, that 
C has at most q free indices) if there exists two subsets D,E C [k] such that \D\ ^ 1 
and the following two properties are verified: (i) \D\ + \E\ ^ g, and (ii) for everjQ Xd = 
{x a '■ a G -D} G [iV]' 13 ' and every y# = {y& : 6 G -E} G [iV]'- 5 '', there exists at most one 
chain c as in ( I4.39P such that i a = x a for every a G D and j& = for every b & E. Note 
that our definition contemplates the possibility that E — 0, and in this case the role of 
2/£ = is immaterial. In other words, the class C has at most q degrees of freedom if every 
c G C is completely determined by those i a in the upper sub-chain such that a G D and 



4 As indicated by our notation, we regard xd and de as vectors, respectively in [7V]I D I and [iV]^, by 
endowing D and E with the natural ordering induced by the ordering on [A;]. 
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those jb m the lower sub-chain such that b G E. For instance, it is easily seen the class 
C(2k,N) has (exactly) 2k degrees of freedom. Another example is the diagram in Fig. 
[51 which corresponds to the case k = 6, 7r = {{1, 2}, {3, 5}, {4, 6}} and u — v — 1. One 
sees that, for every N, specifying ii, i± and ji completely identifies a chain inside the class 
Cl' 1 (12, N), which has therefore three degrees of freedom!! 




Figure 5: a class with three degrees of freedom 

The proof of the two (useful) results contained in the next statement is elementary and 
omitted. 

Lemma 4.3 Fix k, N ^ 2. 

(1) Let q — 1, 2k. Assume that a generic class C C C(2k, N) has at most q degrees of 
freedom. Then, \C\ ^ N q . 

(2) Let 1 = {[k]} be the one-block partition of [k]. Then, the class Ci(2k,N) contains 
only "constant" chains of the type \4-39$ such that (21,22) = (^cm Wi) = Ux,jx+i), 
for every a = 2, k and every x = 1, k. It follows that \Ci(2k, N) \ = N. 

Lemma [4.31 will be used in the subsequent section. 
4.3 Combinatorial upper bounds 

We keep the notation introduced in the previous section. The following statement, which 
is the key element for proving Proposition I4.1[ contains the main combinatorial estimate 
of the paper. 

Proposition 4.4 Fix k, N ^ 2, and let n = {Bi, B r } G V(k) be a partition containing 
at least one block of cardinality ^ 2. Let the class C n (2k,N) be defined as at Point 4. of 
the previous section. Then, there exists a finite constant Q(k,ir) ^ ; depending only on k 
and 7r (and not on N), such that 

\C n (2k,N)\ ^ 0{k,ir) x TV*- 1 . (4.40) 

Proof. We shall consider separately the two cases 

Indeed, one has necessarily that i\ — ii — 13 = 25 = ji = j'2 = 33 = J5) H = *6 and J4 = je- 
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A. For every v = 1, r, \B V \ ^ 2. 

B. The partition tt contains at least one singleton. 

Case A. When k = 2,3, the only partition meeting the needed requirements is 1. According 
to Lemma [4.31 (2). \Ci(2k, N)\ = N, so that the claim is proved, and we shall henceforth 
assume that k ^ 4. Start by observing that r ^ k/2. Moreover, the class C 7T (2k,N) 
contains only chains such that at least one block in the upper sub-chain corresponds to a 
block in the lower sub-chain, which yields in turn that 

r 

C n (2k,N)= |J Cr(2k,N), 

u,v=l 

where we adopted the notation introduced at Point 5. of Section 14.21 This implies the 
crude estimate 

r 

\C n (2k,N)\ <: \C^{2k,N)\. (4.41) 

u,v=l 

According to Lemma [4.31 (1). it is now sufficient to prove that each class C™' v (2k, N) has 
at most 2r — 1 degrees of freedom: indeed, (I4.4ip together with the fact that 2r — 1 ^ k — 1 
would imply relation (I4.40p . with Q(k, 7r) = r 2 ^ k 2 /4. Fix «,!)£ {1, r}. To prove that 
C% ,v (2k,N) has at most 2r — 1 degrees of freedom, we shall build two sets D,E C [k] as 
follows. For every s = 1, r, choose an element of the block B s , and denote this element 
by a s . Then, define 

D = {a s : s = 1, ...,r}, E = D\{a v }, 

where '\' denotes the difference between sets. We now claim that, for every xp = {x a : 
a G D} G [N]\ D \ and every y E = {y b : b G E} G [iV]^!, there exists at most one chain 
c G C% ,v (2k, N) as in (14. 39ft such that i a = x a for every a G D and % = y& for every b G E. 
To prove this fact, suppose that such a chain c exists, and assume that there exists another 
chain 



c = {i\ , i' 2 ) (i' 2 ,i' 3 )... (i' k , i[ ) (j[ , f 2 ) {j' 2 ,j' 3 )... (j' k , j[ ] 



verifying this property and such that d G C%' v (2k, N). The following hold: (a) for every 
s = 1, ...,r and every a G B s , one has that i' a = x ag = i as = i a , (b) for every s^t) and 
every a G B s , j' a = y as = j as = j a and (c) for s = v and every a G B v , 



3 a 3a v ^a u X a u *o„ ja v ja- 



As a consequence, d = c. Since \D\ + \E\ = 2r — 1, this concludes the proof of Proposition 
PI in the Case A. 
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Case B . We shall denote by S the collection of the singleton(s) of tt, that is the subset of 
[k] composed of those indices a such that {a} G tt. Note that \S\ > by assumption. We 
also write P for the collection of the indices u G [r] such that \B U \ ^ 2. Note that P is 
a subset of [r], whereas S C [&]. Note also that the set [r]\P is the collection of all those 
v G [r] such that B v is a singleton. Clearly, 

ii i i ^ 2 

By exploiting the cyclic nature of sub-chains, we can always assume, without loss of gener- 
ality, that 5* contains the singleton {k}. Since P is not empty, this entails that there exists 
at least one singleton of tt that is adjacent from the right to a block of cardinality at least 
two. Formally, this means that there exists s* G S and u* G P such that s* — 1 G B u * . We 
shall distinguish two cases 

Bl. The block B u * contains two consecutive integers. 

B2. The block B u * does not contain two consecutive integers. 
{Proof under Bl.) The situation of Bl is illustrated in Fig. [61 where k = 9, 

tt = {B u B 7 } = {{1}, {2}, {3, 6, 7}, {4}, {5}, {8}, {9}}, 
and one can take s* = 8, u* = 3, and the two consecutive integers in B u * are 6 and 7. 









<V0 




^> © 















^> 






Figure 6: a singleton is adjacent to a 3-block with two consecutive elements 

Since each element of C n (2k,N) is such that every singleton in a given sub-chain corre- 
sponds to a block in the opposite sub-chain, we have that 

C*(2k,N)= |J C?(2k,N), (4.42) 

where we adopted the same notation as at Point 5. of Section I4~2| and the union runs over 
the class TZ of all bijections R : U —tV such that both U and V contain the set [r]\P, and 
every pair (u, R{u)) is such that at least one of the two blocks B u and Br( u \ is a singleton. 
This entails the estimate 

\C„{2k,N)\ ^ ^|^(2fc,iV)|. (4.43) 
Ren 
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To conclude the proof, we shall show that every class C^(2k, N) appearing in ( I4.43P has 
at most k — 1 degrees of freedom: indeed, this fact together with Lemma |4~3T (1) yields the 
desired conclusion (I4.40|) . with the constant 0(/c,7r) = \7Z\ (note that the definition of TZ 
does not depend on N) . To prove that C^(2k, N) has at most k — 1 degrees of freedom, 
we define two sets D,E C [k] as follows. For every s — 1, ...,r, choose an element of the 
block B s , and denote this element by a s . Then, define 

D = {a s : s = 1, r}, E — D\ {{a u *} U{a s :sE [r]\P}} . 

In other words, E is obtained by subtracting from D the singleton(s) and the representative 
element of the block B u *, that is, of the block adjacent to {s*}. We now want to prove 
that, for every i D = {x a : a G D} G [iV]l D l and every y E = {y b : b G E} G [A^]I B I, there is 
at most one chain c G C^(2k, N) as in (14.39P such that i a = x a for every a G D and jb = Vb 
for every b G E. To show this, assume that such a chain c exists, and suppose that there 
exists another chain 

d = (^,^)(^,4)-(4,^)(jl,J2)(j2,J3)-(jLji) 

verifying this property and such that d G C^(2k, N) and d 7^ c. By construction of the 
sets D and E, all the indices composing the upper chain are completely determined by 
the choice of xjj, whereas the choice of y E determines the indices j x such that either x is 
a singleton or x G B v for some block B v of cardinality ^ 2 and such that ti / w*. This 
entails in turn that, necessarily since d 7^ c, one has that j' x 7^ j x for every x G B u *. This 
is absurd. Indeed, since B u * contains two consecutive integers, one has that j' x = j' x+1 and 
jx — jx+i f° r every x G B u *; it follows that, since {s*} is adjacent from the right to B u * 
and therefore s* — 1 G B u * , 

3 s* — 1 3 s* V s * js* 3 s* — 1 3x1 

which is indeed a contradiction. Since 

\D\ + \E\ = r + \P\ - 1 < + |5| + k — S - 1 = * - 1, 

— _ 

the proof is concluded. 

(Proof under B2.) Since 73 u » does not contain two consecutive integers and \B U * \ ^ 2, we 
deduce the existence of a block B^ G 7T, which is different from B u * and {s*}, enjoying the 
following "interlacement property": there exists an integer a G [k] such that a + 1 < s* — 1, 
a G and a + 1 G -B^. The block 5 5 can be either a singleton or a block with two or 
more elements. This situation is illustrated in Fig. [TJ corresponding to the case k = 8 
and n = {B 1 , B 5 } = {{1, 2}, {3, 5}, {4, 6}, {7}, {8}}. Here, s* = 7, B u » = £? 3 = {4, 6}, 
73„ = B 2 = {3, 5} and a = 4. 

The crucial remark is now that, for a chain c as in (I4.39P with partition tt, one has that 
i s * = i a +i- Indeed, a and s* — 1 both belong to B u *, and therefore (i a *_i, i a *) = (z a , i +i). 



29 




Figure 7: a singleton is adjacent to a 2-block with no consecutive elements 

Since a + 1 G B^, this fact yields in particular that, i x = i s * for every x G B^, that is, 
the left indices associated with are completely determined by the choice of i s * . By the 
same argument, one shows that j s * = j a +i- The rest of the proof is similar to the case 
Bl. First, we observe that the representation (14. 42 p . with TZ defined exactly as for Bl, 
continues to be true, from which we deduce the estimate (14 .43 p . It is now sufficient to show 
that each class C^(2k, N) has at most k — 1 degrees of freedom. To do this, one chooses a 
representative element from each block B s G it, noted a s , and then defines the sets 

D = {a s : s = 1, r, s ^ u}, E = D\{a s : s G [r] \P} , 

that is, D is built by selecting one element from each block of tt, except for B^, and E is 
obtained by subtracting from D all the remaining indices a such that {a} is a singleton of 
7i. One has that 

\D\ + \E\^k-l. (AAA) 

Indeed, \D\ = r — 1 = \P\ + \S\ — 1 ^ fc ~J^ + 1 S\ — 1 , and then one has to consider two cases: 
either (a) Bu is a singleton, from which it follows that \E\ = \D\ — (\S\ — 1) ^ , or (b) 
Bu is not a singleton, yielding \E\ = \D\ — \S\ ^ ~ !• 111 these two cases, (I4.44p is 
then in order. To conclude, it remains to show that, for every Xd = {x a : a G D} G [iV]' D ' 
and every yg = {y^ : b G E} G [iV]'' 6 '', there is at most one chain c G C^(2k,N) as in 
(I4.39P such that i a = x a for every a G D and jf, = for every b G E. To see this, assume 
that such a chain c exists, and observe that, due to the above considerations, the choice of 
Xd completely determines the upper sub-chain of c, as well as those indices j x in the lower 
sub-chain such that {x} is a singleton of tt or (whenever Bu is not a singleton) such that 
x G By;- Since the remaining left indices in the lower sub-chain of c are determined by the 
choice of the claim is proved. In view of (I4.44p . this shows that C^(2k, N) has at most 
k — 1 free indices. This concludes the proof of Proposition 14.41 

As an illustration of the above arguments, one can consider the diagram in Fig. |8l that is 
constructed from the situation in Fig. [7] by selecting U — V — {2,3,4,5} and R(2) = 4, 
-R(3) = 5, R(A) = 2 and R(5) = 3. In particular, it is easily seen that fixing z 4 , i 7 and ig 
completely identifies a chain c inside the class C^(16, N), that has therefore three degrees 
of freedom. 

□ 
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Figure 8: a class with three free indices 



4.4 Proofs of Proposition 14.11 and Theorem 11.11 

Proof of Proposition \4-l\ We take up the notation introduced in Section 14. 11 In view 
of Proposition 14. 4[ in order to prove relation (I4.38P (and therefore Proposition I4.ip . it 
is sufficient to show that, for every 7r G Q(k), each pair (i,j) G Gn{^) is such that 
the corresponding chain (ii, ^...{ik, 32)---{jki ji) is an element of C w (2k,N), from 

which one deduces iCr^C 71 ")! ^ \C n (2k, N)\ ^ Q(k,7r)N h ~ 1 . To show the desired prop- 
erty, it is enough to prove that, for every pair (i,j) G An(tt) x An (it) such that the 
chain i 2 )...(i k , ii)(jiih)-(3k,3i) is not in C w (2k,N), one has that (i,j) £ G N (n). 
By definition of C n (2k, N), we have to examine two cases. Start by considering a par- 
tition 7r G Q{k) not containing any singleton: if (i,j) G An(it) x A^(ti) is such that 
{h,h)---(ik,ii)(ju32)---(jk,ji) & Cn{2k,N), then the random variables X iaia+1 indexed by 
the upper sub-chain are independent of those indexed by the lower sub-chain, and conse- 
quently 

E[Xi x ^ . . . Xi^Xj^ . . . Xj k j 1 ) — EyXi x i 2 . . . Xi k i 1 )E( y Xj 1 j 2 . . . Xj k jA, 

yielding (i, j) ^ Gn(tt). On the other hand, if 7r G Q{k) contains a singleton and if (i, j) 
is such that (z 1; i 2 )...(ik, «i) (ji, J2 ) - • • ( J fc , ji) & C n (2k,N), then there exists a = l,...,k such 
that Xi a i a+1 or Xj a j a+1 is independent of all the other variables indexed by the elements of 
the chain. This gives 

E{Xi ± i 2 . . . Xi k i 1 Xj 1 j 2 . . . Xj k j 1 ) = Ei{Xi^i 2 . . . Xi k i 1 )E(Xj 1 j 2 . . . Xj k j ± ) = 0, 

thus proving the required property (i, j) (jL Gn(tt). The proof is finished. □ 

Proof of Theorem \l.lY (i): By virtue of the representation fll.12p - fll.13p and of Proposition 
14.11 one sees that, for every 2 ^ k\ < ... < k m , the limit in distribution of the vector 

(Tr(X N ),Tr(X%) - E [Tr(X%)] Tr(X^) - E [Tr(X»] ) 

coincides with the limit in distribution of 



( N \ 

N -l/2\^ Y N~^tX^Y Y Y \^ Y Y Y 

JV / j ^iii Jy / j yv iii2 yv i2t3 A !|; 1 !n ...,iv y\ nl2 y\ l2l3 A i tm ii 

so that the desired conclusion follows from Corollary 13.51 
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□ 

Proof of Theorem M.lY fii): For the simplicity of exposition, we assume that k\ ^ 2, the 
proof when k\ = 1 being completely similar and easier. We have, using the notation D]y 
introduced in the beginning of Section fl~3l and using f II . 1 3 [) . 

Tr(X%) - E[Tr{X%)) Tr(Xfr) - E[Tr(X^)) 







E 


H 



Var(Tr(X^)) 



Var(Tr(X^T)) 



-E 



where, by writing Var(Tr(X^)) = Cj(N), 
( 



A 



N 



E 



<P 



V 

i 2 ■ ■ ■ -<^-ifc, in ■ ■ ■ i 



\ 



Xi x i 2 . . . x ikmil 



(fern) 
JV 



/J 



and 



^ + -Bat, 



fci ^fc„ 
, . . . , 



B 



N 



E 



( 

L \ 



,1/2 ^ Xh 



i 2 ■ ■ ■ ^ik, n i ■ ■ ■ i 



c m (Ny/ 2 N- 



(km) 



-E 



Tr(X^) - E[Fr(X%)] Tr(X ^) - E[Ti(X^) ] 

Var(Tr(X^ m )) 



'Var(Tr(X^)) 

By combining Corollary 12.91 with the computations made in the proof of Proposition 13. 1\ 
we immediately get that Ajy = 0(iV _1//4 ). For B^, we can write 



\b n \ ^ Ky\\ OQ J2 E 



^ ] (^«ii2 • • • X-i k ,i x E\Xi r i 2 . . . jj) 



Var 



N~ 



\ 



} j {Xiiii ■ ■ ■ Xi k _i x E\X,i r i 2 . . . jj) . 



^D N J 



for some constant K not depending on N, so that B^ = 0(N 1 / 2 ) = 0(N x / 4 ) by Propc 
sition 14.11 



□ 
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5 Almost sure central limit theorems (ASCLTs) 
5.1 Preliminaries: a result by Ibragimov and Lifshits 



For x, y e R m (m ^ 1 fixed), we write (x, y) = X\y\ + . . . + x m y m (resp. |x| = y/ (x, x)) to 
indicate the inner product of x and y (resp. the norm of x). The following result, due to 
Ibragimov and Lifshits, plays a crucial role in the proof of Theorem 11.51 

Theorem 5.1 (See [19J) Let G = {G n : n ^ 1} be a sequence of -valued random 
variables converging in distribution towards a random variable Goo, and set 

N 



A N (G,t) = -i-V-fe^l _Me^ G ~>n t eR m . (5.45) 
log N ' n 

n=l 

If, for all r > 0, 



£/ien ; almost surely, for all continuous and bounded function if : R m — > R, we nave 

1 ^^)^ E M tf->oo. (5.47) 

log A* z — ' n 

n=l 

Remark 5.2 1. If i?|A7v(G, t)| 2 = 0(1/ log N) uniformly in t on bounded sets, then 
(I5.46P is automatically satisfied. 

2. See |S] for several applications of Theorem 15. II in the framework of ASCLTs on Wiener 
space. 

The following useful result allows to deal with sequences of random variables having 
the form of a sum of two terms, one of which vanishes in the mean-square sense. 

Lemma 5.3 Let G = {G n : n ^ 1} be a sequence of W 11 -valued random variables converg- 
ing in distribution towards a random variable G^, and satisfying in addition fl,5.^6| ). Let 
R = {R n : n ^ 1} be a sequence of R m -valued random variables converging in L 2 (Q) to 
Roo = 0, and such that 

oo N 1 

V Y -E\R n \ 2 < oo. (5.48) 

Then 

|2 



^ E\A N (G + R,t)\ 
sup ^ 



< oo, 



where G + R = {G n + R n : n ^ 1} and Atv(G + i?, £) zs defined according to 
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Remark 5.4 If E\R n \ 2 = 0(n a ), for some a > 0, then (I5.48P is automatically satisfied. 
Proof of Lemma^ Since A N (G + R,t) = J2n=i ±(e^*' G " +ii «> - £[e^*' G °°>]), one has that 

1 - 1 

A N (G + R,t) = A N (G, t) + — — J2 -z l{t ' Gn) ( ei{t ' Rn) ~ l) > 

log iv ^— ' n 

n=l 

so that, by using |x + y| 2 ^ 2|x| 2 + 2|?/| 2 , Jensen inequality and J2n=i ~ ~ log iV as N — > oo, 
there exists a constant c > (independent of N) such that, for all N ^ 2, 

£?|Ajv(G + i2, t)| 2 < 2E|A,v(G,t)| 2 + V -£| e ^> - l| 2 . 

log TV ' n 

n=l 

Since |e^*' x ^ — 1| ^ we deduce 

rW N 1 

E\A N {G + R, t)\ 2 < 2E|A^(G, t)| 2 + V -E\R n \ 2 . 

log A* ' n 

n=l 

The desired conclusion follows. ,_, 



5.2 Proof of Theorem 11.51 



For the sake of brevity, we shall prove Theorem 11.51 only for powers ki strictly greater than 
one. The general case (ki ^ 1) can be deduced from similar arguments. 

Throughout this section, we fix integers m ^ 1 and k m > . . . > k\ ^ 2. For N ^ 1 and k ^ 
2, we denote (as above) by the collection of all vectors i = (ii, . . . ,ik) € {1, • • • , N} k 
such that all pairs (i a , i a +i), a = 1, . . . , k, are different (with the convention that i^+i — ii), 
that is, i G D$ if and only if (i >*a+i) 7^ (*6>*6+i) f° r every a ^ b and 1 ^ i a ^ iV for 
every a = 1, k. Let 

J,v(fc) = iV- fc / 2 X ni2 X i2i 3 ■ ■ ■ *i k ii, and L N (k) ' ' X ' ^ ' 



Observe that i?[Jjy(A;)] = £7[Ljv(&)] = and Var[L^v(/c)] = 1. The proof of Theorem 11.51 is 
divided into several steps. 

Step 1: bounding E[L n (k)L p (k)}. Fix k ^ 2. We shall prove that there exists a constant 
Cfc > such that, for all n,p^ 1, 



E[L n (k)L p (k)] ^ C k J^^. (5.49) 
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(k) 

By symmetry, we assume without loss of generality that p ^ n. If i G D n and j G 

E[Xi x i 2 . . . Xi k i 1 Xj 1 j 2 . . . Xj k j^\ = 0, 

indeed, if i G Dn and j G Dp \ Z)i fc \ then necessarily there exists a = 1, k such that 
j a > n, and therefore the centered random variable Xj a j a+1 is independent of X ibib+1 for 

every b = 1, k, and also (by the definition of Dp) independent of Xj s j s+1 for every s ^ a. 
It follows that 



E[J n (k)J p (k)] = (±) E • • • X ^ X nn ■ ■ ■ Xju 

I \ fc/2 

— J EE ^[^<i*a • • • X i k h X hj2 ■■■ X : 
fc/2 



n 
P 

fe/2 



Thus S[L ft (jfe)L p (A;)] = (f) yj % n $$ y But we have E[J n (k) 2 } ->■ jfe as n ->■ cx), see 
indeed (I3.32p . As a consequence, we immediately get the existence of a constant C*. such 
that (I5.49P is in order. 

Step 2: showing that sup| f |^ r J2n=2 ^mlgN^ < 00 • ^ ^ 2. Let f kjN be as in (I3.24|) . 
Set g k , N = , 1 = fk,N- We obviously have Ljv(/c) = Q k (gk,N, X). Combining 03.321) and 

^/E[J;v(fc) 2 ] 

(I3.29p . we immediately get that, for all r = 1, . . . , k — 1, ||<7fc,jv *r <?fc,iv||2fc-2r — ©(A -1 / 2 ). 
From now on, for simplicity write Ln = (Ljv(fci), ■ ■ ■ , L^(k m )), N ^ 1, and = e - '*' / 2 , 
tGl m . Corollary EH yields that 

|£[ e i(W] _ ^) | = o(A~ 1/4 ). (5.50) 
On the other hand, for all r = 1, . . . , k — 1, we can write 

IK^fc.iV ~ <?fc,M) *r (<7fc,jV — gk,M)\\2k-2r 
— \\9k,N *r 9k,N + Qk,M *r 9k,M ~ 9k,N *r 9k,M ~ 9k,M *r 9k,N || 2fc-2r 
^ \\9k,N *r fi'fc,A r ||2fc-2r + \\9k,M *r 9k,M || 2fc-2r + 2||<7fc t jV *r 9k,M || 2fe-2r • 



But 



l^fe.JV *r 5'fc,M||2fc-2r 



\J (gk,N *k-r 9k,N, 9k,M *&— r 9k,M) 2r 



9k,N *k-r 9k,N\\2r\ \\9k,M *fc-r 9k,M\\2r 



^ -(\\gk,N *k-r gk,N\\2r + \\9k,M *k-r 9k,Al\W) ■ 
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Consequently, 



9k, N — 9k, M ^ 9k,N — 9k,M 



V2 



V2 



^ \\9k,N~ k r9k,N\\2k~2r+ || #fc,M *»* <?fc,M || 2fc-2r ~ 0(N 1 ^ 2 ), 



2fc-2r 



as iV -)■ oo, uniformly on M ) Af and r = l,...,k — 1, that is, there exists a constant 
Ck > (depending solely on k) such that, for every N, 



sup 

; Af>iV 



v/2 



v/2 



2fc-2r 



Since Var 



L N (k)-L M (k) 
V2 



1 — £ , [Ljv(fc)Z/M(fc)] for all k, by using Corollary 12.91 with 



Ql^X) = (L N {h) - L M {ki))/y/2 - 2E[L N {k i )L M {k i )\ 



and 



(p(zi, ...,z m )= exp (a/1 - E[L N (k 1 )L M (k 1 )]t 1 z 1 + . . . + y/l- E[L N (k m )L M (km)}tmZ m S j 
we get that 



E 



J t L N~ L M ' 

e \ ' , 



(l-£[M*OM*m)])y 

(5.51) 



is 0(iV x / 4 ) as iV — > oo, uniformly on M ^ iV. On the other hand, combining (I5.49P with 
|e~ x2//2 — e^ 1- ™^ / 2 | ^ ax 2 /2 for all x 6 R and a ^ 0, we get that there exists C r > 
such that, for all t E R m with \t\ ^ r, 

0(t) - exp (-(l - E[L N (ki)L M (k 1 )])^ - ... - (l - E[L iV (A; m )L M (A ;m )]) y) 
<; C,\l V. (5.52) 



iVAM 
iVVM' 

Define Ajy(L, t) according to (15. 45ft . with ~ ,yKn(0,I m ). For |t| ^ r, we have, due to 
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E\A N (L,t)\ 2 



log N np 

° n,p=l 
iV 



j4^E^ [(i^i-rti))-* (B( e '<«-») 

° n,p=l 



- 9 (t) (B(e- i <«' l '>) - 9 (*))] 
log ^F t £ [(b^'^)) - 9 (V2()) - 9 (t) (£(e*«) - 9 (t)) 



n.p=l 



iV 



-9 



(t) (J5(e-*<*^>) - <?(*))] ■ 



c r i / /nAp i i 

^ log 2 iV ^ np IV n V» + n 1 / 1 + p^ 

° n,p=l 



It is obvious that 

JV 



1 y- J_ / 1 _L 

log 2 N ^ np I nV4 + p i/4 / ~ V log 

° n,p=l x ' • ' 



Moreover, 

N / : ^9 iV 



O I — ) as N ->■ oo. 



1 1 /nAp < 2 ^y^ 1 y^ 1 

log 2 N ^ npynVp ^ log ^ nJri ^ x fp 

° n,p=l n=l v p=l v 



^ 1 
C \ - 1 

^ -^-z}- = 



log 2 N^n \\ogN 

o n=l v ' 

Hence, supm^ -E|Atv(£, t)| 2 = 0(l/logiV), implying immediately 



E AT WAT < °°- ( 5 " 53) 



I*l2^ 2 N] °gN 



S'tep 5: using Lemma l573i Set T^{k) = ^ • Using f)3.32p and elementary calculations, it 
is immediate that 



k(Vk ■ \/ 1 '. ./ v (7.-) 2 r ^ /,,2 A ' 2 
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so that i?7v := Tn — Ljv verifies condition (I5.48P of Lemma I5T31 Since ( I5.53P is also in order, 
we deduce that sup| t |^ r Y1'n=2 ^mlgN^ < 00 ' wn i cn m turns implies 

V E| ^ (J ^ )|2 < 00. (5.54) 

Step 4: using Lemma \5. 3\ once again. For any k ^ 2, set S^^k) = Ti(A^) — E [Tr(A^)]. 
We have 

N 

Sjv(A;) = (X ili2 Xi 2 i 3 ■ ■ ■ X ikil — E[Xi li2 Xi 2 i 3 ■ ■ ■ X ikil \) = JN{k) + -Rat(/c), 

ii,— ,ifc=l 

with 

RN{k) = N 2 ^ (X^^X^jg ■ • ■ X ifcil — i?[Xj li2 Jf i2 j 3 • ■ • X ikil ]). 

For all k ^ 2, we have E|i?jv(/c)| 2 = 0(1/N), see f !4.34p . Hence, Lemma ET3] together with 

f)5.54p imply that sup^^ Y^n=2 ^m^~N < 00 • ^° fi msn the P ro °f of Theorem 11.51 h 
suffices to apply Theorem 15.11 ^ 
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